switchBox: An R package for k-Top Scoring Pairs (kTSP) classifier development

نویسندگان

  • Bahman Afsari
  • Elana J. Fertig
  • Donald Geman
  • Luigi Marchionni
چکیده

Summary: k-Top scoring pairs (kTSP) is a classification method for prediction from high throughput data based on a set of the paired measurements. Each of the two possible orderings of a pair of measurements (e.g., a reversal in the expression of two genes) is associated with one of two classes. The kTSP prediction rule is the aggregation of voting among such individual two-feature decision rules based on order switching. kTSP, like its predecessor, TSP, is a parameter-free classifier relying only on ranking of a small subset of features, rendering it robust to noise and potentially easy to interpret in biological terms. In contrast to TSP, kTSP has comparable accuracy to standard genomics classification techniques, including Support Vector Machines (SVM) and Prediction Analysis for Microarrays (PAM). Here, we describe “switchBox,” an R package for kTSP-based prediction. Availability: The “switchBox” package is freely available from Bioconductor: http://www.bioconductor.org Contact: [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

switchBox: an R package for k-Top Scoring Pairs classifier development

UNLABELLED k-Top Scoring Pairs (kTSP) is a classification method for prediction from high-throughput data based on a set of the paired measurements. Each of the two possible orderings of a pair of measurements (e.g. a reversal in the expression of two genes) is associated with one of two classes. The kTSP prediction rule is the aggregation of voting among such individual two-feature decision ru...

متن کامل

Rgtsp: a generalized top scoring pairs package for class prediction

SUMMARY A top scoring pair (TSP) classifier consists of a pair of variables whose relative ordering can be used for accurately predicting the class label of a sample. This classification rule has the advantage of being easily interpretable and more robust against technical variations in data, as those due to different microarray platforms. Here we describe a parallel implementation of this clas...

متن کامل

Bioconductor’s tspair package

The tspair package contains functions for calculating the top scoring pair for classification of high-dimensional data sets [1]. A top scoring pair is a pair of genes whose relative ranks can be used to classify arrays according to a binary phenotype. A top scoring pair classifier has three advantages over standard classifiers: (1) the classifier is based on the relative ranks of genes and is m...

متن کامل

The tspair package for finding top scoring pair classifiers in R

UNLABELLED Top scoring pairs (TSPs) are pairs of genes whose relative rankings can be used to accurately classify individuals into one of two classes. TSPs have two main advantages over many standard classifiers used in gene expression studies: (i) a TSP is based on only two genes, which leads to easily interpretable and inexpensive diagnostic tests and (ii) TSP classifiers are based on gene ra...

متن کامل

A Generic Framework for Top-k Pairs and Top-k Objects Queries over Sliding Windows

Top-k pairs and top-k objects queries have received significant attention by the research community. In this paper, we present the first approach to answer a broad class of top-k pairs and top-k objects queries over sliding windows. Our framework handles multiple top-k queries and each query is allowed to use a different scoring function, a different value of k and a different size of the slidi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014